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The coronavirus nucleocapsid (N) protein is a viral RNA-binding protein with multiple functions in 
terms of virus replication and modulating cell signalling pathways. N protein is composed of 
three distinct regions containing RNA-binding motif(s), and appropriate signals for modulating 
cell signalling. The subcellular localization of severe acute respiratory syndrome coronavirus 
(SARS-CoV) N protein was studied. In infected cells, SARS-CoV N protein localized exclusively 
to the cytoplasm. In contrast to the avian coronavirus N protein, overexpressed SARS-CoV N 
protein remained principally localized to the cytoplasm, with very few cells exhibiting nucleolar 
localization. Bioinformatic analysis and deletion mutagenesis coupled to confocal microscopy and 
live-cell imaging, revealed that SARS-CoV N protein regions | and Ill contained nuclear localization 
signals and region Il contained a nucleolar retention signal. However, cytoplasmic localization 
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was directed by region Ill and was the dominant localization signal in the protein. 


Severe acute respiratory disease coronavirus (SARS-CoV) is 
a nidovirus. One of the most abundant viral proteins pro- 
duced inside a coronavirus-infected cell is the nucleocapsid 
(N) protein, a multifunctional phosphoprotein (Calvo et al., 
2005; Chen et al., 2005; Jayaram et al., 2005), which can bind 
viral RNA with high affinity (Chen et al., 2005) to form a 
ribonucleocapsid structure (Risco et al., 1996). SARS-CoV 
N protein (SARS-CoV-N) can also regulate cellular processes 
(He et al., 2003; Luo et al., 2005; Surjit et al., 2004). Based on 
amino acid sequence comparisons, three conserved regions 
have been identified in the coronavirus N protein (Parker & 
Masters, 1990), which may serve as RNA-binding sites and 
be involved in modulating cell signalling. 


Coronavirus and the closely related arterivirus N proteins 
can localize to the cytoplasm, the nucleus and the nucleolus 
(Hiscox et al., 2001; Ning et al., 2003; Rowland et al., 1999; 
Tijms et al., 2002; Wurm et al., 2001), and can bind/interact 
with nucleolar proteins (Chen et al., 2002; Yoo et al., 2003). 
The nucleolus is a dynamic subnuclear structure that is 
involved in ribosome subunit biogenesis and in the control 
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of cell growth (Andersen et al., 2005; Lam et al., 2005). 
Morphologically, the nucleolus can be divided into a fibrillar 
centre (FC), a dense fibrillar component (DFC) and an outer 
granular component (GC) (Thiry & Lafontaine, 2005). Whilst 
nucleolar localization of N protein has been described for 
arterivirus and coronaviruses, the potential nuclear/nucleo- 
lar localization of the SARS-CoV-N is unknown. The cellular 
localization of SARS-CoV-N may inform us to its potential 
function in modulating cell processes. 


To investigate the distribution of SARS-CoV-N in the con- 
text of virus infection, Vero cells were infected with SARS- 
CoV with an m.o.i. of 0-5 in a P3 facility following EU 
biosafety regulations, and fixed at 16, 24, 48 and 72 h post- 
infection (p.i.) using 8% paraformaldehyde for analysis by 
indirect immunofluoresence with a rabbit polyclonal serum 
specific for SARS-CoV-N coupled with an Alexa 488- 
conjugated anti-rabbit immunoglobulin G. The nucleus was 
stained with ToPro3 (Molecular Probes). The experiment 
was repeated four times and representative images are 
shown in Fig. 1(a). The reason for choosing the later time 
points was that Qinfen et al. (2004) reported virus-like 
particles in the nucleus of Vero E6 cells infected with SARS- 
CoV at 72 h p.i. Our study indicated that whilst N protein 
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Fig. 1. (a) Confocal microscopy of the localization of SARS-CoV-N (green) at 16, 24, 48 and 72 h in cells infected with 
SARS-CoV, the nucleus is stained blue. For comparison, examples are also given for 16 and 72 h time points without staining 
of the nucleus. (b) Live-cell imaging of cells expressing EGFP, ECFP, EGFP-IBV-N or ECFP-SARS-CoV-N, several 
representative examples of the latter two are shown. Nucleoli are indicated by arrows. (c) Confocal microscopy of cells 
expressing either EGFP-IBV-N (green) or ECFP—SARS-CoV-N (blue) at 24, 48 and 72 h post-transfection. Nucleoli are 
indicated by arrows. Histogram of the percentage of cells expressing either EGFP-IBV-N (empty column) or ECFP—SARS- 
CoV-N (filled column), which exhibit cytoplasmic and nucleolar localization. The data are derived from 10 fields of view taken 
from three replicate experiments. (d) Confocal microscopy of cells expressing either EGFP (green) or ECFP (green) and cells 
transfected with both pECFP-SARS-CoV-N (false colour red) and pEGFP-IBV-CoV-N (green) at 24, 48 and 72 h post- 
transfection. Co-localization, where it occurs, is shown in yellow and nucleolus is indicated by an arrow. 
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localized to the cytoplasm at all time points, no nuclear or 
nucleolar localization was observed. Given the low m.0o.i. 
used, the infection was not synchronised, and the earlier 
time point of 16 h p.i. may represent cells infected between 
16 and 10h p.i. (this latter time point due to cells being 
infected by progeny virus of the first round of infection). 


Whilst no nucleolar localization of SARS-CoV-N was 
observed in infected cells, the successful detection of 
nucleolar proteins using antibodies can be related to the 
concentration of the protein within the nucleolus (Sheval 
et al., 2005), in that the nucleolus is not always refractive 
to antibody staining. In addition, charged proteins can also 
migrate through cells post-fixation and become localized 
to the nucleus (Lundberg & Johansson, 2001, 2002). To 
address these concerns, and to investigate the subcellular 
localization of SARS-CoV-N in more detail, we generated 
vectors that expressed the protein (or parts of the protein) 
as a fusion with either enhanced cyan or green fluorescent 
protein (ECFP or EGFP, respectively). The protein could 
then be detected by direct fluorescence using both live-cell 
and confocal microscopy. As a control, we compared the 
subcellular localization of SARS-CoV-N with Infectious 
bronchitis virus N protein (IBV-N), which was located in 
either the cytoplasm alone or the cytoplasm and nucleolus 
(Hiscox et al., 2001). 


SARS-CoV N gene was cloned downstream of ECFP (vector 
pECFPC1; Clontech), creating pECFP-SARS-CoV-N, and 
IBV N gene was cloned downstream of EGFP (vector 
pEGFPC2; Clontech), creating pEGFP-IBV-N, and when 
expressed in cells, resulted in fluorescent fusion proteins 
ECFP-SARS-CoV-N and EGFP-IBV-N, respectively. To 
examine the subcellular localization of SARS-CoV-N, 
pECFP-SARS-CoV-N and pEGFP-IBV-N were transfected 
(using Lipofectin) into either Vero or Cos-7 cells, and sub- 
cellular localization was followed using live-cell imaging at 
24 h post-transfection. As a control, cells were transfected 
with vectors expressing either ECFP or EGFP alone 
(pECFPC1 and pEGFPC2, respectively). The data indicated 
that whilst ECFP and EGFP localized predominately to the 
cytoplasm and the nucleus, the majority of ECFP—SARS- 
CoV-N localized to the cytoplasm, but, in a few cases, also to 
apparent subnuclear structures and the nucleus, whereas 
EGFP-IBV-N localized to either the cytoplasm or the cyto- 
plasm and nucleolus (Fig. 1b). No difference in localization 
of either protein was observed between Vero and Cos-7 cells 
(data not shown), and therefore for subsequent analysis we 
used Cos-7 cells, as transfection was more efficient in this 
cell type. Similarly, in line with our previous results the 
position of a fluorescent tag at either the N terminus (in this 
present study) or C terminus of N protein did not affect 
localization of the fusion protein when compared to native 
protein (Hiscox et al., 2001; data not shown). 


The number of cells exhibiting nucleolar localization 
with ECFP-SARS-CoV-N appeared significantly less in 
comparison to EGFP-IBV-N. Therefore, we examined 
the subcellular localization of ECFP-SARS-CoV-N with 


EGFP-IBV-N over a 120 h period. Cells were transfected 
with either pECFP-SARS-CoV-N or pEGFP-IBV-N. In order 
to accurately determine the distribution of both proteins, 
cells were fixed at each time point and visualized using 
confocal microscopy. Representative confocal images for 
the viral proteins are shown in Fig. 1(c) for 24, 48 and 72 h 
post-transfection. EGFP-IBV-N localized to either the cyto- 
plasm alone or the cytoplasm and nucleolus, with a maxi- 
mum of 40% of transfected cells exhibiting this phenotype 
at 48 h. In contrast, ECFP-SARS-CoV-N localized pre- 
dominately to the cytoplasm alone with a maximum of 10 % 
of transfected cells, showing both cytoplasm and nucleolar 
localization at 72 h post-transfection. Localization of pro- 
teins to the nucleolus at this time point could be an artefact, 
in that the cells have undergone several rounds of division 
and nuclear and nucleolar assembly. No difference was 
observed in the subcellular localization of both coronavirus 
N proteins using either live-cell imaging or fixing cells and 
visualizing using confocal microscopy. Therefore, this latter 
technique could be used to determine subcellular localiza- 
tion of the N protein in combination with marker proteins. 


One hypothesis to account for the difference in nucleolar 
localization was that ECFP-SARS-CoV-N prevented entry 
or retarded entry of proteins into the nucleus. For example, a 
number of RNA viruses that replicate in the cytoplasm can 
alter the nuclear pore complex to prevent either import 
and/or export of proteins and RNAs to the nucleus (Gustin, 
2003; Gustin & Sarnow, 2001, 2002). To investigate this, we 
co-transfected cells with pECFP-SARS-CoV-N and pEGFP- 
IBV-N, cells were then fixed at 24, 48 and 72 h, and the sub- 
cellular localization of these fusion proteins was examined 
using a Zeiss LSM510 Meta confocal microscope such 
that the ECFP signal was unmixed from the EGFP signal. 
Representative cells are shown in Fig. 1(d) (note that in 
dual-transfected cells ECFP has been falsely coloured red). 
The data indicated that in dual-transfected cells, EGFP—-IBV- 
N localized to the cytoplasm and nucleolus with the same 
frequency as in singly transfected cells (data not shown), and 
that ECFP-SARS-CoV-N did not prevent the entry of a 
similar protein into the nucleus. 


Another possibility to account for the difference in sub- 
cellular distribution between IBV-N and SARS-CoV-N was 
that ECFP-SARS-CoV-N disrupted nucleolar architecture, 
therefore preventing or reducing its own localization to the 
nucleolus. A number of viruses and viral proteins can 
disrupt nucleolar architecture (Hiscox, 2002, 2003), includ- 
ing IBV-N, although this did not prevent IBV-N from 
localizing to the nucleolus (Chen et al., 2002). In addition, 
the previous experiment indicated that SARS-CoV-N did 
not inhibit localization of IBV-N. However, to investigate 
this possibility in more detail we examined cells that 
expressed ECFP-SARS-CoV-N by using confocal micro- 
scopy. Different nucleolar regions can be dissected using a 
combination of nucleolar marker proteins and appropriate 
fluorescent tag. B23 (DS-red) and nucleolin (EGFP) can be 
used to distinguish the FC and DFC, and EGFP-fibrillarin 
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can be used to distinguish the GC (Fig. 2a). In addition, 
phase-contrast microscopy can be used to visualize the FC, 
as this appears less refractive than the surrounding nucleolar 
regions (Fig. 2a). 


Cells were co-transfected with pECFP-SARS-CoV-N and 
vectors that direct the expression of EGFP-—nucleolin, 
EGFP-fibrillarin and DS-red-B23. The confocal microscope 
was used to take 1-12 um sections through cells expressing 
EGFP and ECFP tagged N proteins to visualize the nucleolus 
and the FC. Bright field microscopy (Fig. 2b) and the 
nucleolar markers (EGFP-—nucleolin and DS-red-B23) 
(Fig. 2c and d, respectively) indicated that the FC was 
present in cells expressing ECFP-SARS-CoV-N. These 
markers also showed that there was no difference in the 
distribution of EGFP-nucleolin and DS-red-B23 between 
mock-transfected cells and those cells expressing ECFP— 
SARS-CoV-N. The data indicated that whilst EGFP- 
fibrillarin was punctate in appearance in untreated cells 
(Fig. 2a), it appeared more evenly distributed in the DFC in 
cells co-expressing ECFP—SARS-CoV-N and EGFP-fibril- 
larin (Fig. 2e). Redistribution and/or interaction with fibril- 
larin has been described previously for both coronavirus and 
arterivirus N proteins (Chen et al., 2002; Yoo et al., 2003). 


Trafficking of proteins to the nucleus can be directed by 
appropriate nuclear localization signals (NLSs) including 
pat4, pat7 and bipartite motifs, which are groupings of 
arginine and lysine residues (Macara, 2001). Export of 
proteins from the nucleus is mediated via nuclear export 
signals (NESs) perhaps best characterized by CRM-1- 
dependent signals, which share a leucine-rich motif 
(Macara, 2001; Ossareh-Nazari et al., 2001). In contrast, 
nucleolar retention/localization signals (NoRSs/NoLSs) are 
not well characterized and there is no consensus sequence 
(Carmo-Fonseca et al., 2000). Marra et al. (2003) suggested 
that SARS-CoV-N contained a pat7 NLS in region III 
located between amino acid residues 369 and 376, and 
predicted that potential nuclear localization of SARS-CoV- 
N could account for the unique pathogenesis of the virus 
compared with other coronaviruses. However, corona- 
viruses representative of three known groups also contain 
potential NLSs in region III (Hiscox et al., 2001; Wurm et al., 
2001), indeed the pat7 motif identified in SARS-CoV-N is 
most similar to the pat7 motif located in region II] of IBV-N. 
Using bioinformatic analysis (la Cour et al., 2004; Nakai & 
Horton, 1999), we found that SARS-CoV-N contained both 
pat7 and pat4, and bipartite NLSs, as well as a potential 
CRM-1-dependent NES (Fig. 3a). In contrast, IBV-N con- 
tained two overlapping pat4 and pat7 motifs and a potential 
CRM-1-dependent NES in region III (data not shown). A 
number of cellular and viral proteins have multiple NLSs, 
which regulate their nuclear localization (Burich & Lei, 
2003; Haffar et al., 2000; Luo et al., 2004; Stelz et al., 2002; 
Tsai & McKay, 2005), and therefore we predicted that SARS- 
CoV-N may contain signals that modulate its localization. 


We hypothesized that SARS-CoV-N regions I (aa 1-156) 
and III (aa 300-422) would direct nuclear localization, with 


region III being the most pronounced as it had three 
potential NLSs, and region IH (aa 157-299) exhibiting 
nuclear and possibly cytoplasmic localization, as this region 
also contains a potential NES motif (Fig. 3a). To test this 
hypothesis, we cloned the regions downstream of ECFP, 
creating pECFP-SARS-CoV-RI, pECFP-SARS-CoV-RII and 
pECFP-SARS-CoV-RIII, whose expression would lead to 
the synthesis of ECFP tagged to SARS-CoV-N regions I, II 
and II, respectively. The proteins were visualized using live- 
cell and fixed imaging, and to visualize nucleoli proteins 
were imaged by confocal microscopy and staining with 
propidium iodide (Fig. 3b). ECFP-SARS-CoV-RI localized 
predominately to the nucleus with some cytoplasmic 
localization, but not to the nucleolus. ECFP—SARS-CoV- 
RII localized predominately to nucleoli and ECFP—SARS- 
CoV-RIII localized to the cytoplasm, cytoplasm and nucleus, 
and cytoplasm and nucleoli. The most frequent being 
cytoplasmic localization. Although these fragments contain 
predicted NLSs, alternatively nuclear/nucleolar localization 
of these proteins could be directed by their interaction with 
other cellular proteins, which contain appropriate NLSs/ 
NoRSs. For example, nucleolar targeting of hepatitis 6- 
antigen is mediated through binding to nucleolin (Lee et al., 
1998). 


Given that SARS-CoV-N region II contains a predicted NES, 
but was retained in the nucleolus, we examined the potential 
of this motif to direct nuclear export of EGFP. Aa 220-240, 
encompassing the predicted NES, were cloned downstream 
of EGFP. Expression of this plasmid in transfected cells 
indicated that EGFP—NES was non-functional (Fig. 3c) and 
had a similar localization pattern to EGFP (Fig. 1d). Given 
that SARS-CoV-N contains at least five potential NLSs, we 
would have predicted that deletion of aa 228-234 (encom- 
passing the predicted NES) in the full-length SARS-CoV-N 
would have resulted in ECFP-SARS-CoV-N being localized 
to the nucleus. However, ECFP—SARS-CoV-N with the NES 
deletion (ECFP—N-ANES) remained localized to the cyto- 
plasm (Fig. 3d), suggesting the presence of an alternative 
nuclear export/cytoplasmic retention signal. 


Therefore, we tested the ability of the potential double pat7/ 
bipartite nuclear localization located in region HI to direct 
ECFP to the nucleus and found that localization of this 
protein was entirely nuclear (Fig. 3e) compared with ECFP 
alone (Fig. 1c), indicating that this motif was a functional 
NLS. Given that ECFP-SARS-CoV-RIII localized to the 
cytoplasm, this region may therefore contain a dominant 
cytoplasmic retention/nuclear export signal that acts on 
the whole protein. Certainly, treatment of cells expressing 
ECFP-SARS-CoV-N with 8 ng leptomycin B (Sigma- 
Aldrich) ml~' did not reveal any retention of ECFP- 
SARS-CoV-N in the nucleus (data not shown), unlike 
similar experiments with an arterivirus N protein (Tijms 
et al., 2002). 


Our data indicates that overexpressed SARS-CoV-N can 
localize to the cytoplasm and nucleolus, but with signi- 
ficantly less efficiency than IBV-N. No nucleolar localization 
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Fig. 2. (a) Markers for different regions of the nucleolus. Bright field (BF) microscopy for the FC, EGFP-nucleolin (Nuc), DS- 
red-B23 (B23) for the FC and DFC, and EGFP-fibrillarin (Fib) for the GC. Cells expressing ECFP-—SARS-CoV-N examined 
by BF microscopy (b) and confocal microscopy using selected fluorescent tagged markers, EGFP-—nucleolin (c), DS-red-B23 
(d) and EGFP-fibrillarin (e) to visualize distinct regions of the nucleolus. Optical sections (shown as 1-12 jum increments) 


were taken to visualize the FC (arrows). Nucleoli are electronically enhanced. 
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Fig. 3. (a) Amino acid sequence of the SARS-CoV-N with potential pat4 and pat7 NLSs boxed, the potential bipartite NLS is 
underlined and the potential NES is outlined by the hashed box, arginine and lysine residues are shown in bold face, and a 
putative bipartite NLS is shown in italics. Below this are schematic representations of the constructs used to investigate the 
distribution of localization signals in the SARS-CoV-N. All constructs were sequenced and their expression confirmed by 
Western blot (data not shown). (b) Live-cell and confocal images (individual channel plus merged), showing the distribution of 
ECFP-SARS-CoV-RI, ECFP-SARS-CoV-RIl and ECFP-SARS-CoV-RIll (several examples of which are given). Pl, Propidium 
iodide. Live-cell images of EGFP—NES (c), ECFP—N-ANES (d) and ECFP-NLS (e) are shown. 
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of SARS-CoV-N was observed in virus-infected cells, and 
the difference between this and overexpression maybe due to 
a number of factors including the ability of antibodies to 
penetrate the nucleolus, association of N protein with other 
viral proteins or modification of the protein. The data 
suggests that region II contains a NoRS, and that whilst 
regions I and III contain NLSs, in the case of region III this is 
overridden by an undefined non-CRM-1 NES/cytoplasmic 
retention motif, which may be the dominant signal for 
localization of SARS-CoV-N inside the cell. Conformational 
change of the protein, perhaps through differential phos- 
phorylation or cleavage, may expose the NLSs/NoRS with 
low efficiency. 
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